4 




RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/030 , 3 30 



DATE: 02/08/2002 
TIME: 11:11:46 



Input Set : A:\sequence.txt.app 

Output Set: N : \CRF3\020820 02\J0303 30 ♦ raw 



ENTERED 



3 <110> APPLICANT: The University of Georgia Research Foundation, Inc - 

5 <120> TITLE OF INVENTION: A POLYPEPTIDE HAVING AMIDOLYTIC ACTIVITY FOR A SERPIN 

7 <130> FILE REFERENCE: 235.00210201 



C--> 9 


<14 0> CURRENT APPLICATION NUMBER: US/10/030,330 










C--> 10 


<141> CURRENT FILING DATE: 2001- 


10-19 












* 


12 


<150> PRIOR 


APPLICATION 


NUMBER : 


60/130, 436 












13 


<151> PRIOR 


FILING DATE: 


1999-04-21 
















15 


<160> NUMBER OF 


SEQ 


ID NOS: 


6 


















17 


<170> SOFTWARE: 


Patentln Ver . 2 . 


1 
















19 


<210> SEQ ID NO 


1 
























20 


<211> LENGTH: 843 
























21 


<212> TYPE: 


PRT 


























22 


<213> ORGANISM: 


Porphyromonas gingivalis 












24 


<400> SEQUENCE: 


1 
























25 


Met 


Lys 


Lys 


Ser 


Phe 


Leu 


Leu 


Ala 


He 


Val 


Met 


Leu 


Phe 


Gly 


He 


Ala 


26 


1 








5 










10 










15 




28 


Met 


Gin 


Gly 


His 


Ser 


Ala 


Pro 


Val 


Thr 


Lys 


Glu 


Arg 


Ala 


Leu 


Ser 


Leu 


29 








20 










25 










30 






31 


Ala 


Arg 


Leu 


Ala 


Leu 


Arg 


Gin 


Val 


Ser 


Leu Arg Met Gly Gin 


Thr 


Ala 


32 






35 










40 










45 








34 


Val 


Ser 


Asp 


Lys 


He 


Ser 


He 


Asp 


Tyr 


Val 


Tyr 


Arg 


Gin 


Gly 


Asp 


Ala 


35 




50 










55 










60 










37 


Glu 


Arg 


Gly 


He 


Thr 


Ser 


Gin 


Glu 


Glu 


Gly 


Ser 


Pro 


Ala 


Tyr 


Phe 


Tyr 


38 


65 










70 










75 










80 


40 


Val 


Ala 


Asn 


Arg 


Gly 


Asn 


Asn 


Glu 


Gly 


Tyr 


Ala 


Leu 


Val 


Ala 


Ala 


Asp 


41 










85 










90 










95 




43 


Asp 


Arg 


He 


Pro 


Thr 


He 


Leu 


Ala 


Tyr 


Ser 


Pro 


He 


Gly Arg 


Phe 


Asp 


44 








100 










105 










110 






46 


Met 


Asp 


Ser 


Met 


Pro 


Asp 


Asn 


Leu 


Arg 


Met 


Trp 


Leu 


Gin 


He 


Tyr 


Asp 


47 






115 










120 










125 








49 


Gin 


Glu 


He 


Gly 


Leu 


He 


Leu 


Ser 


Gly 


Lys 


Ala 


Gin 


Leu 


Asn 


Glu 


Glu 


50 




130 










135 










140 










52 


He 


Leu 


Arg 


Thr 


Glu 


Gly 


Val 


Pro 


Ala 


Glu 


Val 


His 


Ala 


Leu 


Met 


Asp 


53 


145 










150 










155 










160 


55 


Asn 


Gly 


His 


Phe 


Ala 


Asn 


Asp 


Pro 


Met 


Arg 


Trp 


Asn 


Gin 


Gly Tyr 


Pro 


56 










165 










170 










175 




58 


Trp 


Asn 


Asn 


Lys 


Glu 


Pro 


Leu 


Leu 


Pro 


Asn 


Gly 


Asn 


His 


Ala 


Tyr 


Thr 


59 








180 










185 










190 






61 


Gly 


Cys 


Val 


Ala 


Thr 


Ala 


Ala 


Ala 


Gin 


He 


Met 


Arg 


Tyr 


His 


Ser 


Trp 


62 






195 










200 










205 








64 


Pro 


Leu 


Gin 


Gly 


Glu 


Gly 


Ser 


Phe 


Asp 


Tyr 


His 


Ala 


Gly 


Ser 


Leu 


Val 


65 




210 










215 










220 










67 


Gly 


Asn 


Trp 


Ser 


Gly 


Thr 


Phe 


Gly 


Glu 


Met 


Tyr 


Asp 


Trp 


He 


Asn 


Met 



file://C:\Crf3\Outhold\VsrJ030330.htm 



2/8/02 



Page 2 of 7 



RAW SEQUENCE LISTING DATE: 02/08/2002 
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68 225 230 235 240 

70 Pro Gly Asn Pro Asp Leu Asp Asn Leu Thr Gin Ser Gin Val Asp Ala 

71 245 250 255 

73 Tyr Ala Thr Leu Met Arg Asp Val Ser Ala Ser Val Ser Met Ser Phe 

74 260 265 270 

76 Tyr Glu Asn Gly Ser Gly Thr Tyr Ser Val Tyr Val Val Gly Ala Leu 

77 275 280 285 

79 Arg Asn Asn Phe Arg Tyr Lys Arg Ser Leu Gin Leu His Val Arg Ala 

80 290 295 300 

82 Leu Tyr Thr Ser Gin Glu Trp His Asp Met lie Arg Gly Glu Leu Ala 

83 305 310 315 320 

85 Ser Gly Arg Pro Val Tyr Tyr Ala Gly Asn Asn Gin Ser lie Gly His 

86 325 330 335 

88 Ala Phe Val Cys Asp Gly Tyr Ala Ser Asp Gly Thr Phe His Phe Asn 

89 340 345 350 

91 Trp Gly Trp Gly Gly Val Ser Asn Gly Phe Tyr Lys Leu Thr Leu Leu 

92 355 360 365 



94 Ser Pro Thr Ser Leu Gly lie Gly Gly Glu Gly lie Gly Phe Thr lie 

95 370 375 380 



97 Tyr Gin Glu lie lie Thr Gly lie Glu Pro Ala Lys Thr Pro Ala Glu 


98 : 


385 








390 








395 








400 


100 


Ala 


Gly 


Thr 


Asp 


Ala 


Leu 


Pro 


He 


Leu 


Ala 


Leu 


Lys 


Asp 


He 


Glu 


Ala 


101 










405 










410 










415 




103 


Glu 


Tyr 


Lys 


Ser 


Glu 


Ser 


Gly 


Leu 


Asn 


Val 


Gly 


Tyr 


Ser 


He 


Tyr 


Asn 


104 








420 










425 










430 






106 


Thr 


Gly 


Glu 


Glu 


Gin 


Ser 


Asn 


Leu 


Asp 


Leu 


Gly 


Tyr 


Arg 


Leu 


Asn 


Lys 


107 






435 










440 










445 








109 


Ala 


Asp 


Gly 


Glu 


Val 


He 


Glu 


Val 


Lys 


Thr 


Ser 


Ser 


He 


Asn 


He 


Ser 


110 




450 










455 










460 










112 


Trp 


Tyr 


Gly 


Tyr 


Gly 


Glu 


His 


Pro 


Glu 


Ser 


Phe 


Ser 


Leu 


Ala 


Pro 


Asn 


113 


465 










470 










475 










480 


115 


Gin 


Leu 


Ser 


Gin 


Gly 


He 


Asn 


Thr 


lie 


Thr 


Leu 


Leu 


Tyr 


Arg 


Arg 


Thr 


116 










485 










490 










495 




118 


Gly 


Thr 


Glu 


Gin 


Trp 


Glu 


Pro 


Val 


Arg 


His 


Ala 


Gin 


Gly 


Gly 


Tyr 


Val 


119 








500 










505 










510 






121 


Asn 


Ser 


He 


Lys 


Val 


Asn 


Thr 


Thr 


Asp 


Pro 


Asn 


Asn 


Val 


Val 


Val 


Thr 


122 






515 










520 










525 








124 


Val 


Asp 


Asn 


Asn 


Glu 


Gly 


Lys 


Leu 


Ser 


He 


Val 


Pro 


Asn 


Ser 


Phe 


Val 


125 




530 










535 










540 










127 


Ala 


Asp 


Leu 


Asn 


Ser 


Tyr 


Glu 


His 


Ser 


Thr 


He 


Thr 


Val 


Gin 


Phe 


Asn 


128 


545 










550 










555 










560 


130 


Ser 


Asp 


Ser 


Pro 


Asp 


Glu 


He 


Arg 


Thr 


Pro 


Val 


Ala 


Phe 


Ala 


Leu 


Ser 


131 










565 










570 










575 




133 


Thr 


Gly 


Ala 


Thr 


Ala 


Asp 


Asp 


Val 


He 


Ser 


Leu 


Gly 


Trp 


Val 


Met 


Ala 


134 








580 










585 










590 






136 


Glu 


Val 


Pro 


Gly 


Gly 


Ser 


Ser 


Asn 


Tyr 


Pro 


Val 


Val 


Trp 


Ser 


Lys 


Asp 


137 






595 










600 










605 








139 


Val 


Leu 


Thr 


Leu 


Ser 


Glu 


Gly 


Asp 


Tyr 


Thr 


Leu 


Trp 


Tyr 


Arg 


Phe 


Ser 


140 




610 










615 










620 
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1 /I n 


lie 


Asn 


Asn 


Gin 


Lys 


Asp 


VjIU 


Trp 


Lys 


Lys 


lie 


Liiy 


Ser 


vai 


Ser 


vai 


1 A 7 


DZ J 










D JU 










*3 ^ 

D J J 










fid. n 


1 A CL 


Lys 


Thr 


Pro 


Tnr 


bill 


Tyr 


Thr 


nlS 


Pro 


Leu 


Fne 


VjIU 


Val 




nlS 


Asn 


14 0 










645 










fi 










£ s s 

O J J 


b 


1 A Q 

14 o 


Gin 


Tnr 


Ser 


Thr 


Tyr 


Thr 


Leu 


Asp 


Met 


Ala 


his 


Asn 


Arg 


vai 


Leu 


Pro 


1 A Q 

14 y 








fi g n 










s r 

D 0 O 










670 






TCI 
1 01 


Asp 


Fne 


Tnr 


Leu 


Lys 


Asn 


Leu 


kjiy 


Leu 


Pro 


Fne 


Asn • 


oiy 


LrlU 


Leu 


vai 


lOz 






b / 0 










b oU 










C ft c: 

boo 








1 o4 


Val 


val 


Fne 


Arg 


Gin 


Thr 


Gin 


Ser 


Ser 


Ser 


Gly 


Ser 


Leu 


Trp 


Aia 


A 1 -a 

Ala 


ice 

15b 




r n A 










by o 








■ 


n c\ c\ 










10/ 


Gin 


bill 


Tnr 


vai 


His 


He 


Lys 


Gin 


Giy 


G1U 


Thr 


Fne 


vai 


Tyr 


Lys 


Pro 


loo 


/ U 0 










710 










/ 1 o 










7 o n 


160 


Val 


Val 


Glu 


Gly 


Pro 


He 


Pro 


Asp 


Gly 


Ser 


Tyr 


Arg 


Ala 


mr 


Leu 


nlS 


lol 










725 










7 i n 










7TC 

/ J J 




lb j 


Ala 


Fne 


IT- 1 

val 


Asn 


Gly Gin 


Gin 


Gin 


Leu 


Tyr 


Leu 


Lys 


Giy 


Lys 


Arg 


Asn 


JL D *i 








/ *i \J 










7/ t: 
/ 1 J 










7^0 






166 


Tyr 


Thr 


Val 


Lys 


He 


Val 


Asn 


Gly 


Thr 


Ala 


Val 


Glu 


Ala 


He 


Glu 


Ser 


167 






755 










760 










765 








169 


Ser 


Glu 


Glu 


He 


Arg 


Val 


Phe 


Pro 


Asn 


Pro 


Ala 


Arg 


Asp 


Tyr 


Val 


Glu 


170 




770 










775 










780 










172 


He 


Ser 


Ala 


Pro 


Cys 


He 


Pro 


Gin 


Glu 


Thr 


Ser 


He 


He 


Leu 


Phe 


Asp 


173 


785 










790 










795 










800 


175 


Leu 


Ser 


Gly 


Lys 


He 


Val 


Met 


Lys 


Asn 


Ser 


Leu 


Ser 


Ala 


Gly 


His 


Gly 


176 










805 










810 










815 




178 


Arg 


Met 


Asp 


Val 


Ser 


Arg 


Leu 


Pro 


Asn 


Gly 


Ala 


Tyr 


He 


Leu 


Lys 


Val 


179 








820 










825 










830 






181 


Asp 


Gly 


Tyr 


Thr 


Thr 


Lys 


He 


Asn 


He 


Val 


His 













182 835 840 

185 <210> SEQ ID NO: 2 

186 <211> LENGTH: 2532 

187 <212> TYPE: DNA 

188 <213> ORGANISM: Porphyromonas gingivalis 

190 <400> SEQUENCE: 2 

191 atgaaaaaaa gttttctttt agccatagta atgctctttg gcattgccat gcagggacat 60 

192 tctgctccgg ttacgaaaga gcgagctttg agtctggctc ggctggcttt gcgacaggta 120 

193 tccttgcgaa tgggacaaac agcagtatct gacaagattt ccatcgatta cgtttatcgg 180 

194 caaggagatg ctgagagggg tatcacatca caagaggaag gctctcctgc atatttttat 240 

195 gtagctaatc gtggaaataa tgagggctat gctcttgtag cagcagatga cagaataccg 300 

196 acaattttag cctattcacc cattggccgt ttcgacatgg acagtatgcc ggacaatctt 360 

197 cgcatgtggc tacaaattta cgatcaggaa ataggcctga tactttccgg aaaagctcag 420 

198 ctcaatgaag agatattacg taccgagggc gtaccggctg aagtacatgc tctgatggat 480 

199 aacggtcatt ttgccaacga tcccatgcga tggaatcaag gttacccatg gaacaataag 540 

200 gaaccactgc ttcctaatgg caatcatgcc tataccggct gtgttgctac tgctgcagca 600 

201 caaatcatgc gctaccatag ctggccgctt caaggtgaag gctctttcga ttatcatgca 660 

202 ggttcattag ttggcaactg gtccggcaca tttggtgaaa tgtacgactg gatcaatatg 720 

203 cccggaaatc ccgaccttga taatctgact caatctcaag tggatgccta cgccacactg 780 

204 atgcgtgatg tgagtgcatc tgtttcgatg agtttttatg aaaatggaag tggtacgtac 840 

205 agcgtttatg tagtaggagc cttgcgaaac aactttcgct acaagcgttc actgcagcta 900 

206 catgtacgcg ccttatatac ctcacaggag tggcacgata tgatccgcgg ggaacttgcc 960 
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207 tccggaaggc cggtctatta tgcagggaat aaccagagca taggacatgc 

208 gatggttatg cttcggatgg tactttccat ttcaactggg gttggggagg 

209 ggcttctaca aactaacact cctctcgccg acttcgttgg gtatcggagg 

210 ggttttacca tttatcaaga gatcatcacc ggtatcgaac cggctaagac 

211 gccggtacag atgccttgcc gatcttggca ctgaaagaca tagaagccga 

212 gaatccggat tgaacgtagg gtattcgata tataatacag gtgaagagca 

213 gacctcggat acagattgaa caaggctgac ggagaagtca tagaggtgaa 

214 atcaatatct cttggtacgg atacggagag catcccgaga gtttctcatt 

215 cagttgtcac aaggaatcaa caccatcacc ctactttatc gtcgcacagg 

216 tgggagccgg tacggcatgc acagggagga tatgtcaata gcattaaagt 

217 gacccgaaca atgtcgtagt cacggtagat aataacgaag gcaagctcag 

218 aacagctttg tcgcagatct gaattcttat gaacatagta cgattacagt 

219 agcgacagcc ctgatgagat ccgtacaccc gtagcctttg ctctatctac 

220 gcggacgatg taatatcttt gggctgggta atggctgaag ttccgggcgg 

221 tatccggtgg tttggtctaa agacgttctc actctctcgg aaggcgacta 

222 tatagatttt ccatcaacaa ccaaaa'ggat gaatggaaaa agatcggaag 

223 aaaacaccga cagagtatac gcacccctta ttcgaagtgg gccataatca 

224 tatacgctgg atatggcaca caacagagta ttgcccgact ttacactcaa 

225 ttgcctttca atggtgagtt ggttgttgtt ttccgccaaa cacaatcctc 

226 ttatgggcag ctcaagaaac agtacatatc aagcaaggag aaactttcgt 

227 gttgtcgaag gccctatacc tgatggatcc tatcgtgcga ccctccatgc 

228 ggacaacaac agttgtacct caaggggaaa aggaactaca cggtgaagat 

229 acagcggtag aagcaataga atcgtcagaa gagatcagag tattccctaa 
2 30 gattatgtgg aaatatcggc accttgcatt ccccaagaaa catctatcat 
2 31 ctgtcaggca agattgtcat gaagaatagt ttatcagcgg ggcatggcag 

232 agccgacttc ctaatggggc ctacatcctt aaggtggatg gatatacgac 

233 atagtgcact aa 

236 <210> SEQ ID NO: 3 

237 <211> LENGTH: 418 

238 <212> TYPE: PRT 

239 <213> ORGANISM: Homo sapiens 

241 <400> SEQUENCE: 3 

242 Met Pro Ser Ser Val Ser Trp Gly He Leu Leu Leu Ala 

243 1 5 10 
24 5 Cys Leu Val Pro Val Ser Leu Ala Glu Asp Pro Gin Gly 



tttcgtttgc 
tgtttccaac 
tgagggaata 
tcccgctgaa 
gtataaaagt 
atcaaatctt 
aacttcatct 
ggcacctaat 
caccgaacag 
aaatacgaca 
tatcgtcccc 
acagttcaat 
aggagctact 
tagcagcaac 
tacattgtgg 
cgtgtcagta 
aacttctacc 
aaatctcgga 
atcggggtct 
atataaacct 
attcgtaaac 
cgtcaatggt 
tccggcacgc 
tcttttcgat 
aatggatgtc 
gaaaataaat 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2532 



246 



20 



25 



248 Gin Lys Thr Asp Thr Ser His His Asp Gin Asp 



249 



35 



40 



2 51 Lys He Thr Pro Asn Leu Ala Glu Phe Ala Phe 
252 50 55 

254 Leu Ala His Gin Ser Asn Ser Thr Asn He Phe 

255 65 70 75 
He Ala Thr Ala Phe Ala Met Leu Ser Leu Gly 

85 90 



His Pro 
.4 5 

Ser Leu 
60 

Phe Ser 



257 
258 



Thr Lys 

260 His Asp Glu He Leu Glu Gly Leu Asn Phe Asn Leu Thr 



261 



100 



105 



263 Glu Ala Gin He His Glu Gly Phe Gin Glu Leu 



264 



115 



120 



266 Gin Pro Asp Ser Gin Leu Gin Leu Thr Thr Gly 



Leu Arg 
125 
Asn Gly 



Gly Leu Cys 
15 

Asp Ala Ala 
30 

Thr Phe Asn 

Tyr Arg Gin 

Pro Val Ser 
80 

Ala Asp Thr 
95 

Glu He Pro 
110 

Thr Leu Asn 
Leu Phe Leu 
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267 130 135 140 

2 69 Ser Glu Gly Leu Lys Leu Val Asp Lys Phe Leu Glu Asp Val Lys Lys 
270 145 150 155 160 

272 Leu Tyr His Ser Glu Ala Phe Thr Val Asn Phe Gly Asp Thr Glu Glu 

273 165 170 175 

275 Ala Lys Lys Gin lie Asn Asp Tyr Val Glu Lys Gly Thr Gin Gly Lys 

276 180 185 190 

278 lie Val Asp Leu Val Lys Glu Leu Asp Arg Asp Thr Val Phe Ala Leu 

279 195 200 205 

2 81 Val Asn Tyr lie Phe Phe Lys Gly Lys Trp Glu Arg Pro Phe Glu Val 
282 210 215 220 

284 Lys Asp Thr Glu Glu Glu Asp Phe His Val Asp Gin Val Thr Thr Val 

285 225 230 235 240 
2 87 Lys Val Pro Met Met Lys Arg Leu Gly Met Phe Asn lie Gin His Cys 
288 245 250 255 

2 90 Lys Lys Leu Ser Ser Trp Val Leu Leu Met Lys Tyr Leu Gly Asn Ala 
291 260 265 270 

293 Thr Ala lie Phe Phe Leu Pro Asp Glu Gly Lys Leu Gin His Leu Glu 

294 275 280 - 285 

296 Asn Glu Leu Thr His Asp lie lie Thr Lys Phe Leu Glu Asn Glu Asp 

297 290 295 300 

299 Arg Arg Ser Ala Ser Leu His Leu Pro Lys Leu Ser lie Thr Gly Thr 

300 305 310 315 320 

302 Tyr Asp Leu Lys Ser Val Leu Gly Gin Leu Gly lie Thr Lys Val Phe 

303 325 330 335 

305 Ser Asn Gly Ala Asp Leu Ser Gly Val Thr Glu Glu Ala Pro Leu Lys 

306 340 345 350 

308 Leu Ser Lys Ala Val His Lys Ala Val Leu Thr lie Asp Glu Lys Gly 

309 355 360 365 

311 Thr Glu Ala Ala Gly Ala Met Phe Leu Glu Ala lie Pro Met Ser lie 

312 370 375 380 

314 Pro Pro Glu Val Lys Phe Asn Lys Pro Phe Val Phe Leu Met lie Glu 

315 385 390 395 400 

317 Gin Asn Thr Lys Ser Pro Leu Phe Met Gly Lys Val Val Asn Pro Thr 

318 405 410 415 
320 Gin Lys 

324 <210> SEQ ID NO : 4 

325 <211> LENGTH: 15 

326 <212> TYPE: PRT 

327 <213> ORGANISM: Homo sapiens 

329 <400> SEQUENCE: 4 

330 Gly Ala Met Phe Leu Glu Ala He Pro Met Ser He Pro Pro Glu 

331 15 10 15 

334 <210> SEQ ID NO : 5 

335 <211> LENGTH: 840 

336 <212> TYPE: PRT 

337 <213> ORGANISM: Porphyromonas gingivalis 
339 <400> SEQUENCE: 5 

34 0 Met Lys Arg He Phe Tyr Thr Leu Gly Leu Leu Leu Leu Cys Leu Pro 
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